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We discuss the optimal design problem in regression models with long-range dependence error 
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1. Introduction 

Consider the common linear regression model 

y(t) = e 1 f 1 (t) + --. + e p f p (t)+e(t), (i.i) 

where fi(t), . . . , f p (t) are known functions, e(t) is a random error, 6i,...,8 p denote the 
unknown parameters and t is the explanatory variable. We assume that N observations, 
say yi, . . . , j/jv, can be taken at experimental conditions — T < t\ < ■ ■ ■ < < T to es- 
timate the parameters in the linear regression model (1.1). If an appropriate estimate, 
say = (6\, . . . ,8 P ) T , has been chosen, an optimal design minimizes a function of the 
variance-covariance matrix of this estimate, which is called an optimality criterion (see, 
for example, Silvey (1980) or Pukclshcim (1993)). 
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Under the assumption of uncorrelated observations, optimal designs have been studied 
by numerous authors (see the two books cited above and the textbooks of Fedorov (1972), 
Pazman (1986) and Atkinson and Donev (1992)). However, fewer results are available 
for dependent observations, although this problem is of particular interest because in 
many applications, the variable t in the regression model (1.1) represents time and all 
observations correspond to one subject. The reason for this is that optimal experimental 
designs for regression models with correlated observations have a very complicated struc- 
ture and arc difficult to find, even in simple cases. Because explicit solutions are rarely 
available, an asymptotic theory was developed by Sacks and Ylvisaker (1966, 1968). In 
the Sacks- Ylvisaker approach, the design set is fixed and the number of design points in 
this set tends to infinity. As a result of this assumption, the design points become too 
close to each other and the corresponding asymptotic optimal designs depend only on 
the behavior of the correlation function in a neighborhood of the point 0. 

Bickel and Herzberg (1979) and Bickel, Herzberg and Schilling (1981) considered a 
different model, where the design interval expands proportionally to the number of ob- 
servation points and the correlation structure of errors is not used for the construction of 
the least-squares estimate. The variance- covariance matrix of the estimate 6 is of order 
0(1) in the model considered by Sacks and Ylvisaker (1966) and of order 1/N in the 
model discussed by Bickel and Herzberg (1979). Therefore, the approach of Bickel and 
Herzberg makes the optimal designs derived for the dependent and independent cases 
more comparable. These authors assumed that the observations in model (1.1) have a cor- 
relation structure corresponding to a non-degenerate stationary process with short-range 
dependence, where a correlation function p satisfies p(t) = o(l/t) if t — > oo. As examples, 
in Bickel and Herzberg (1979) and Bickel, Herzberg and Schilling (1981), asymptotic 
optimal designs are derived for the linear regression model with and without intercept 
and for the location model. 

The purpose of the present paper is to extend the Bickel-Herzberg approach to the 
case of a stronger dependence of the errors in the linear regression model (1.1), which 
corresponds to an error process with long-range dependence. Long-range dependence is 
observed in many applications, including hydrology, geophysics, turbulence, diffusion, 
economics and finance. The phenomenon was already observed by Pearson (1902) in 
astronomy and by Smith (1938) in agriculture. Further examples where long-range de- 
pendence is discussed can be found in Granger (1980), Mandelbrot (1973), Porter-Hudak 
(1990), Beran, Sherman, Taqqu and Willingcr (1992), Barndorff-Nielsen et al. (1990), 
Beran (1992), Metzler et al. (1999), among many others. The interested reader is re- 
ferred to the books of Beran (1994) and Doukhan et al. (2003), which contain a good 
description of the basic properties of long-range dependence processes and an extensive 
bibliography on this subject. 

Most of the literature considers the estimation problem but - to the best knowledge 
of the authors - design problems for regression models with long-range dependence error 
structure have not been considered thus far. In Section 2, we introduce the basic termi- 
nology and describe the optimal design problem. Our main results are given in Section 3, 
where we derive an asymptotic expression for the variance-covariance matrix, the basis 
for the construction of optimal designs in the regression model (1.1) with a long-range 
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dependent error structure. These results are different from the findings of Kiinsch, Beran 
and Hampel (1993), who considered random explanatory variables. Finally, in Section 4 
several asymptotic optimal designs are derived for the linear regression model and com- 
pared with the results obtained by Bickel and Herzberg (1979) under the assumption of 
a short-range error structure. 

2. Optimal designs for dependent observations 

Consider the linear regression model (1.1), where the error process e(t) is the second-order 
process with 

Ee(i) = 0, Ee(t)e(s) =<r 2 r{t,s), (2.1) 

and assume that 

(CI). The regression functions /i(t), . . . , f p (t) are linearly independent and bounded on 
the interval [—T,T] and satisfy a first order Lipschitz condition, that is, |/»(t) — fi(s) \ < 
M\t-s\ and \fi(t)\ < M for all t,se [— T, T], i = l,...,p. 

Following Bickel and Herzberg (1979), we assume that e(t) = s^(t) + e^(t), where 
e^(t) denotes a stationary process with correlation function p(t) and e^(t) is white 
noise. Consequently, we obtain 

r(t,s)= 1 p(t-s) + (l- 1 )S t>8 , (2.2) 

where 5 is the Kronecker symbol. If N observations, say y = (j/i, . . . , j/at) t , are available 
and the form of the correlation function is known, then the vector of parameters can 
be estimated by the weighted least squares, that is, 9 = (X T Y,~ 1 X)~ 1 X T Y>~ 1 y with 
X T = (/i(£j))i=i'"'p , and the variance-covariance matrix of this estimate is given by 

T>(6) = (x 2 (X T 'E- 1 X)- 1 (2.3) 

with £ = ("fp(ti — tj) + (1 — 7)<5i.j)i.j, i,j — 1,...,N. However, in most applications, 
knowledge about the correlation structure is not available and the unweighted least- 
squares estimate 9 = (X T X)~ 1 X T y is used. For this estimate, the variance-covariance 
matrix is given by 

D(0) = a 2 {X T X)- 1 X T EX(X T X)- 1 . (2.4) 

An experimental design £ = {fi, . . . ,ijv} is a vector of N points in the interval [— T,T], 
which defines the time points or experimental conditions where observations are taken. 
Optimal designs minimize a functional of the variance-covariance matrix of the weighted 
or unweighted least-squares estimate. Following Bickel and Herzberg (1979), we consider a 
correlation function which depends on the sample size N and is of the form /Ojv(t) = p(Nt), 
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where the function p satisfies p(t) — > if t — > oo; this corresponds to expanding the 
interval as the number of observations grows. The standard least-squares estimate is 
considered in the following discussion because when computing this estimate, the form 
of the correlation function p(t) is not used. Despite this, the least-squares estimate often 
has good properties compared to the best linear unbiased estimate; see, for example, 
Adenstedt (1974), Samarow and Taqqu (1988), Yajima (1988, 1991) and Bcran (1994), 
page 179, among others. For some results regarding nonlinear regression, the reader is 
referred to Ivanov and Lconcnko (2004, 2008). 

For our asymptotic investigations, we consider a sequence of designs £n = {tiN, ■ ■ ■ j ^ aw} 
which is generated using a continuous non-decreasing function 

a:[0,l]^[-T,T\ (2.5) 

by 

t iN = a{{i-l)/(N-l)), i = l,...,N, (2.6) 

where the function a(u) is the inverse of a distribution function. Note that the function 
a is obtained as the weak limit of £jv as N — > oo. The equally spaced design corresponds 
to the choice a{u) = (2u— 1)T (u G [0, 1]); changing the function a yields different types 
of the design. For example, the choice a~ 1 (x) = (x 3 + l)/2 yields designs which are more 
concentrated in the interval [—1/2, 1/2]. We assume several regularity conditions on the 
function a, which are required for the asymptotic results which are to follow. More 
precisely: 



(C2) . Let a(u) be twice differentiable and assume that there exists a positive constant 
M < oo such that for all u € (0, 1), 

jj < a'(u) < M, \a"(u)\<M. (2.7) 

(C3). The correlation function p is differentiable with bounded derivative, that is, 
\p'(t)\ < M, t 6 (0,oo) and p'{t) < for sufficiently large t. 

The last assumption implies that p(t) is nonncgativc for sufficiently large t. In contrast 
to Bickel and Hcrzbcrg (1979) we assume that 

/>oo 

/ \p(t)\dt = oo (2.8) 
Jo 

and this assumption corresponds to the long-range dependence of the observations. Note 
that in this case it follows that 



/ |p(i)|dt =£>(*)| =00, 

Jo k=0 
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where p(k) = cav(s^ 1 > (t),e^ {t + k)). The correlation function of a stationary process 
with long-range dependence can be written as 

Pa{t) = W' l<1 ^ 00 ' (2 ' 9) 

where < a < 1 and L(t) is a slowly varying function (SVF) for large t [Doukhan et al. 
(2003)], and satisfies 

p a (t)=0(l/\t\ a ), |*1 -> oo. 
In this case, we will say that p a (t) belongs to the SVF family. 



3. Main results 

First, we introduce two parametric families of correlation functions which are important 
in applications. 

The correlation function p a (t) belongs to the Cauchy family if it is defined by 

where f3>0, < a < 1 [see Gneiting (2000), Aim et al. (2004), Barndorff-Nielsen and 
Leonenko (2005)]. This family includes 

pP(t) = - ; * , pW( t ) = _i_ > p£)(t) 



(l + |i| 2 )"/ 2 ' W ' a W (l+|t|) Q ' 

which have a totally different shape in a neighbourhood of the point t = but the same 
asymptotic behavior for large t (see Figure 1). These three functions are known as the 
characteristic functions of the symmetric Bessel distribution, Linnik distribution and 
symmetric generalized Linnik distribution, respectively. 

The correlation function p a {t) belongs to the Mittag-Lcfflcr family if it is defined by 

p a {t)=E v A-\A a ), E «A-t)=r(P)Y^ T ^ k +py t>0, (3.2) 

where 0<a<l, < v <l, (3>v (see Schneider (1996), Barndorff-Nielsen and Leonenko 
(2005)). This family is a smooth interpolation of long-range dependence (0 < a < 1, 
j3 = 1, < v < 1) and short-range dependence {v = 1, < a < 1, j3 = 1). Note that the case 
!/ = l,j8 = l,a = l corresponds to ordinary diffusion g a (t) = e - '*' , which is the correlation 
function of a Markovian Ornstein-Uhlenbeck process. On the other hand, the case < 
f<l,/3 = l,0<a<l corresponds to subdiffusion or slow diffusion (see Metzler and 




Klaftcr (2000)). In particular, 

Bi,i(-t) = e-*, E h2 (-t) = (1 - e-*)/t, Ei, 3 (-t) = 2(e" 4 - 1 + 1)/* 2 , 



^l/2,l(-t) = 




In the following discussion, we derive optimal designs for the three families of correlation 
functions, which are given by (2.9), (3.1) and (3.2). The function Q(t) = Y^JLi p(jt) pl a Y s 
an important role in the asymptotic analysis by Bickel and Herzberg (1979), but in the 
case of long-range dependence, this function is infinite. For an asymptotic analysis under 
long-range dependence, we introduce the function 

1 N 

Qa(t) = Jim —-^2 Pa (jt), (3.3) 
n^co d a {N) 

3=1 
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where the normalizing sequence is given by 
d a (N)-. 



N^ a , if a < 1 and p a has the form (3.1) or (3.2), 

IniV, if a = 1 and p a has the form (3.1) or (3.2), 

Li^N 1 -", if a < 1 and p a has the form (2.9), 

L(N) IniV, if a = 1 and p Q has the form (2.9), 



and show in Lemma 1 below that the function Q a (t) is well defined. 

Lemma 1. // the correlation function p a {t) belongs to the Cauchy, SVF family or to 
the Mittag-Leffier family with < a < 1, < v < 1, v < f3, (f,/3) ^ (1, 1), iften i/ie /imit m 
^5.5) exists and is given by 

< a < 1, 



where 



if p a (t) belongs to the Mittag-LefHer family, 



rcs-i/)' 

1, otherwise. 
Proof. Define the function 

1 N 

^(*) = ^v)E^(i*) 

and assume that the correlation function p a (t) is an element of the Cauchy family. Since 
the function p a (t) defined in (3.1) is positive and decreasing for < a < 1, we have 

JV 



<*,N(t) = — T — - ■ r^wfl ds + O 



N 1 -* J (l + \st\P) a /P 



l-a 



N i- a J q N a (l/NP + \st/N\?) a /P \N 

dv -of * W 1 + o( 



= Qa(t) + 

For a — 1, we obtain 



o (\vt\P) a /P \N a - Q2 J {l-a)\t\ a \N a ~ a2 
1 



J\J~a—a 2 



l N l l l 

Ql (t) = ^ h7iv g (i + \jtn^ = ^ TtfEly J JTTWW d{st) 
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= lira tt-, — 77 / z ; — r dv = — , 

n \t\]nNj 1 + H 

which completes the proof for the case where p a (*) belongs to the Cauchy family. 

Now assume that the correlation function p a (t) is an element of the Mittag-Leffler 
family. For the sake of brevity, we only consider the case f3 = 1,0 < a < 1,0 < v < 1, all 
other cases being treated similarly. Since 

Ev ' lHtn ~ \t\°T(l-v) 
as * — > oo (see, for example, formula (3.17) in Schneider (1996)), we have, for < a < 1, 

1 N 1 

Observing that 



E»A-\t\) 



for f->oo (sec Djrbashian (1993)), we obtain, for a = 1, 

N 



r(/3) 



iv^oolniV^ IJ Iy |*|r(/?-i/) 

Finally, assume that the correlation function p a (*) is an element of the SVF family. We 
then obtain 

QJt)= lim ; } , [ ^-ds 



L(Nts/N)d(s/N) 



fN 

n^oo L(N)N 1 ~ a 1 ' J N a \st/N\ a 
nl L(Ntv) dv _ f 1 dv 1 



lim 



N^ooJ L(N)\vt\ a J Q \vt\ a (l-a)\t\ a ' 
where we have used Theorem 2.6 from Seneta (1976) in the last line. For a = 1, we have 

,. 1 f N LU) i ,. 1 f N List)/ UN) , 

Qi(t)= lim — / -ds = lim / v ;/ v ; ds 

^ w N^yoo L(N)hiN J 1 \st\ n ]nN J 1 st 

1 f N 1 . 1 f N L(st)/L(N)-1 , 

= lim / - — r ds + lim / v , , ds 

w->oolniV J 1 \st\ N \nN J 1 \st\ 

1 , If 1 L(Nvt)/L(N)-l , 1 

= — + hm / — '-1—? — dv = — , 

|*| N lnNj 1/N \vt\ |*| 
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Next, we find a comfortable asymptotic representation for the main term in the 
variance-covariance matrix of the least-squares estimates. 

Lemma 2. Assume that the correlation function p a (t) belongs to the Cauchy, Mittag- 
Leffler or SVF family, such that 

l 

Q a (a'(t))dt <oc, (3.4) 

and that the regularity conditions (C1)-(C3) in Sections 2 and 3 are satisfied. We have 

1 f 1 

d ^ N 22 fs{UN)fr(tjN)p a (N(t jN - t iN )) = 2 J f s (a(u))f r (a(u))Q a (a'(u))du + o(l) 

as N — > oo for all s,r = 1, . . . ,p, < a < 1. 

Proof. We only give a proof for the correlation function from the Cauchy family and 
< a < 1, the proof for the other cases being similar. We use the notation / = f s , g = f r , 
p = p a and the decomposition 

N a - 2 f(UN)g(t jN )p(N(t 3N - Un)) =Si + S 2 , 

where 

N N 

S 1 =2N a - 2 Y / f(UN)g(U N ) ]T p(N(t jN -t iN )), (3.5) 

i—l 

N N 

S 2 =2N a - 2 J2f(U N ) (9(tjN)-9(tiN))p(N(t jN -t iN )). (3.6) 

i—l 

With the notation ijy = (i — 1)/(N — 1), we obtain from the differentiability of the func- 
tions a and p 

(j - ft* 

p(N(t jN - t iN )) = p(N(a(j N ) - a(ijv))) = p(a'(i N )(j - i)) + " N _ \ > 

where \v\ < M 2 /2. Let r?j denote a sequence such that r^ — > oo slowly as o(7V( 1_Q ^ 3 ) 
and consider the cases i<r^ and i> rjy in (3.5) and (3.6) separately. Note that 



N 



J2 p(N(t jN -t m )) 



j=i+r N 



N 



= J2 p(N(a(j N )-a(i N ))) 



j=i+r N 



Designs under long-range dependence 



1045 



N oo 

<M J2 p({]-i)/M)<MY J P( k l M )=o(N 1 - a ) 

j=i+r N k=r N 

as TV — > oo uniformly with respect to j . where M is a constant and where we have used 
the fact that the function a'(u) is bounded from below and Lemma 1. Similarly, we obtain 

N 

Y (sfejv) - g(t iN ))p(N(tj N - u N )) 

j=i+r N 

oo 

<2MT J2 \p{N(t jN -t iN ))\=o(N 1 - a ) 

as N — > oo uniformly with respect to j because the function g is bounded. This implies 
that 



JV 



i+r N 



S 1 =2N a - 2 ^2f(t iN )g(t iN ) Y p(N(t jN -t iN )) +o(l), 



(3.7) 



i=l 

JV 



j=i+l 



i+r N 



S 2 = 2N a ~ 2 J2 f(tiN) Y (9(tjN) - g(tiN))p(N(t jN - t iN )) + o(l) (3. 

i— 1 

as N — > oo. For the first term on the right-hand side of (3.8), we obtain the estimate 

N i+r N 



S 2 = N a -' 2 



N — tiN)) 



j=i+l 



i+r N 



<2N a - 1 M 2 T J2 \p(N(t jN -t iN ))\ 



j=i+i 

i+r N 



<2N a -Hl 2 T Y, (\p(a'(i N )(j-i))\+M 2 ^^ 

< 2N a ~ 1 M 2 T [Mr n + M 2 r N /N) = o(l) 
as iV — > oo, while the dominating term on the right-hand side of (3.7) is given by 

JV i+r N 



St = N a ~ 2 Y f(tiN)g(ti N ) Y p( N ( f jN - Un)) 



i=l 
N 



j=i+l 



i+rN 



N^YWMtiN) Y p(a'(i N )(j-i)) + o(l) 

i=l j=i+l 
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N 1 

= A^ 1 ]T f(t tN )g(t lN )Q a (a'(i N )) + o(l) = / f(a(u))g(a(u))Q a (a'(u)) du + o(l) 
i=i 70 

as iV — > oo, which proves the assertion of Lemma 2. □ 

Theorem 1. Let the correlation function p a {t) be an element of the Cauchy, Mittag- 
Leffler or SVF family. If (3-4) and the regularity assumptions (C1)-(C3) stated in Sec- 
tions 2 and 3 are satisfied, then we obtain for the variance-covariance matrix of the 
least-squares estimate defined in (2.4) 

^-T7Wp(6) = 2jW~ 1 (a)R a (a)W~ 1 (a) + 0(l/d Q (JV)), 
d a (N) 

where the matrices W and R a are given by 

W(a)=([ / i (a(u))/ 3 -(a(«))d«Y . 

wo / i. J= l 

R a {a) =( f Ma(u))f 3 (a(u))Q a (a'(u))du) P . 



Proof. In view of (2.2), we obtain that 

\ »#j «'=1 / k.l=l 

where X T = (fi(tjN))iZi' "'p an d Un = a ((* — — 1))>* = 1, . • - ,iV. An application 
of Lemma 2 yields 



N w d a (N)N ' v y \d a {N) / 

The assertion of the theorem now follows by inserting these limits into (2.4). □ 

Note that the constant 7 only appears as a factor in the asymptotic variance-covariance 
matrices of the least-squares estimate. Because most optimality criteria are positively 
homogeneous (see, for example, Pukelsheim (1993)), it is reasonable to consider the 
matrix 

W- 1 {a)R a {a)W 1 {a), 



which is proportional to the asymptotic variance-covariance matrix of the least-squares 
estimate. Moreover, if the function a corresponds to a continuous distribution with a 
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density, say <f>, then a'(t) = l/<p(t) and the asymptotic variance- covariance matrix of the 
least-squares estimate is proportional to the matrix 

* a {<f>)=W- 1 {4>)R a {4>)W- 1 ■(</>), 

where the matrices W{4>) and R a {4>) are given by 
W(cf>)=([ T f i (t)f j (t)^(t)dt) 

R a (<f>) =ff T / l (t)/,(t)Q Q (l/^(t))0(i)dt] 

= ^—(fh{t)fm 1+a (t)A 

i--a\j_ T J iij=1 p 

and we have used the representation Q a (t) = c/((l — a)|i| a ) for the last identity. An 
(asymptotic) optimal design minimizes an appropriate function of the matrix ^S a (<p) (for 
classical least-squares estimation). Note that under long-range dependence, the variance- 
covariance matrix of the least-squares estimate converges slower to zero than in the case 
of independent or short-range dependent errors. In the case of short-range dependence, no 
other normalization is necessary, apart from normalizing the variance-covariance matrix. 
Under long-range dependence, an additional factor d a (N)/N is needed. Moreover, it is 
worthwhile to note that under long-range dependence, the asymptotic variance-covariance 
matrix is fully determined by the function Q a {t) and does not otherwise depend on the 
particular correlation function p a (t). In the following section, we discuss several examples 
in order to illustrate the concept. 



4. Examples 

In most cases, the asymptotic optimal designs for the regression model (1.1) have to be 
found numerically; explicit solutions are only possible for very simple models. In this 
section, we consider models with one or two parameters. 



4.1. Optimal designs for linear models with one parameter 

Consider the linear regression model with p = l, that is, y(t) = 8f(t) + e(t) (6 £ M). In this 
case, the problem of minimizing the asymptotic variance-covariance of the least-squares 
estimate reduces to the minimization of the function 

* aip)= (jmmw J p(t)dt 

in the class of all non-negative functions p(t) on the interval [—T,T]. Note that we 
have represented the density 4> by p/ J p(x)dx, which simplifies the calculation of the 
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directional derivatives in the following discussion. Because Q a (t) is strictly convex on 
(0,oo), it follows from Theorem 3.1 in Bickel and Herzberg (1979) that a minimizer, say 
p* (i) , exists and that (f>* (t) = p* (t) / J p* (t) dt is the asymptotic optimal density. For the 
minimizing function p* , we obtain 



d_ 



* Q (p* +e(p-p*)) 



>0 



for all non- negative functions p on the interval [—T,T]. Consequently, the asymptotic 
optimal density should satisfy J p* (t) dt = 1 and 



(f (i) (H a (l/p*(t))-n) + ?) ( P (t) - p* (i)) dt > 



(4.1) 



for all non-negative functions p on the interval [-T, T] , where the function H a : (0, 00) 
IR + is given by 

H a (t) = Q a (t) - tQ' a (t) = I I^/^' < a < X ' 

[2/t, a = l, 



A* = 2 



// 2 WQa(l/p*W)p*(t)dt 

JP(t)p*(t)dt 

f{t)Q a (\/p*{t))p*(t)dt. 



(4.2) 



First, assume that < a < 1. Note that the function H a is strictly decreasing with 
H a (+0) = 00, H a (oo) = and that its inverse is given by 



H-(t) = 

Hence the solution of (4.1) has the form 

1 



1 + a 
t{\ - a) 



l/o 



V*{t)={ 



H-\n-r/P(t)) 



lo, 



1-Q 

1 + a 



(M-r// 2 (t)) 



l/a 



M-t// 2 (<)>0, 
otherwise, 



(4.3) 



where /i is defined by (4.2) and 



r = J f{t)Q a (l/p*(t))p*(t)dt + J f(t)Q' Q (l/p*(t))dt. 



(4.4) 
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Note that r is a solution of the equation J p(t) dt = 1. Indeed, multiplying f 2 (t)H a (l/p*(t)) 
nf 2 (*) — T by p* (t) and integrating with respect to t yields 



f 2 (t)H a (l/p*(t)) P *(t)dt = J (flf(t)-T) P *(t)dt = H J f 2 (t) P *(t)dt-T. 

Now, the definition of H a (t) and (i gives 

f 2 (t)Q a (l/ P *(t))p*(t) dt- J f(t)Q' a (l/p*(t)) dt 
f(t)Q a (l/p*(t))p*(t)dt-T, 

which yields (4.4). Consequently, we have proven the following result. 

Theorem 2. Assume that the correlation function p a (t) is an element of the Cauchy, 
Mittag-Leffter or SVF family. Then, for the one-parameter linear regression model, the 
asymptotic optimal design exists, is absolute continuous with respect to the Lebesgue 
measure and has the density p*(t) defined in (4-3), where \i and r are given by (4-2) and 
(4-4) ■> respectively. 

We now consider two special cases, which are of particular importance. If p = 1 and 
f(t) = 1, then we obtain the location model and the asymptotic optimal density is the 
uniform density, that is, 

P*(t) = (2T' |t| " T ' (4-5) 
1 0, otherwise. 

Similarly, in the linear regression through the origin, we have p=l, f(t) = t and the 
asymptotic optimal density is given by 

r o, ^ \t\ < ^vjji, 

p{t) = \ (rrS (/i _ T/t2) ) 1 " - - Ti 



0, otherwise, 



where 



' (l-a)ftMt)df J P [) 

and a is the parameter of the correlation function. The above formulas are given for 
< a < 1. For a = 1 and f(t) =t, the asymptotic optimal density is the uniform density 
(4.5). The optimal densities for the parameters a = 1/4,1/2,3/4,0.95 and T = 1 are 
displayed in Figure 2. The parameters /i and r and the efficiency of uniform design are 
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-1 -0.8 -0.6 -0.4 -0.2 0.2 0.4 0.6 0.8 1 

Figure 2. Asymptotic optimal design densities for the linear regression through the origin, 



T= 1. 



shown in Table 1 . We observe that the uniform design is rather inefficient for small values 
of the parameter a. The uniform design has a reasonable efficiency only if a is close to 
1. 

It is worthwhile to mention that the asymptotic optimal designs derived thus far depend 
sensitively on the parameter a, which is usually not available before the experiment. 
Because misspecification of this parameter can result in a substantial loss of efficiency of 
the optimal design, we propose the construction of robust designs which are less sensitive 
with respect to such misspecifications. More precisely, we denote by p* a (t) the optimal 
density design for parameter a. Following Dette (1997) or Miiller and Pazman (1998), a 
robust version of the optimality criterion is of the form 

\J> Q (p* ) 



where p* a is the optimal design for the correlation function p a and A is set of possible a 
values specified by the experimenter. A design maximizing is called standardized max- 
imin optimal. Numerical optimization of this function for the set A = {0.1, 0.2, . . . , 0.9} 



Designs under long-range dependence 1051 

Table 1. Parameters of the asymptotic optimal design density for the linear regression through 
the origin and the efficiency of uniform design (4.5), T = 1 (the optimal density for a — 1 is 
p(t) = l/2, -1<*<1) 



a a t vV/i eff un i 



0.05 2.34 1.06 0.67 0.40 

0.25 3.19 0.96 0.55 0.59 

0.50 4.32 0.70 0.40 0.78 

0.75 6.84 0.44 0.25 0.93 

0.95 24.78 0.25 0.10 0.99 



Table 2. Efficiency of the standardized maximin optimal design v\ defined by (4.6) in the 
linear regression through the origin (the correlation structure is given by the SVF family with 
parameter a) 



a 


0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


eff(p*,a) 


0.84 


0.92 


0.97 


0.99 


0.99 


0.97 


0.94 


0.89 


0.84 



shows that standardized maximin optimal design has a density which can be approxi- 
mated by the function 



p\(t) = (5.7275t 2 - 1.16963 - 3.0264£ 4 )_ 



(4.6) 



which is close to the optimal density p* a for a = 0.44. In Table 2, we show the efficiency 
of this design for various values of a. We observe that the design p* A is very efficient for 
all elements in the set A. 



4.2. Linear regression 

Consider the case p = 2, /i(t) = 1, f 2 {t) = t, which corresponds to the linear regression 
model. In this case, the asymptotic variance-covariance matrix is proportional to 



*a(p) 



1 



tp(t) dt 



J tp{t)dt \ 


1 / 




R{p) 


Jt 2 p(t)dt ) 


\ 



tp{t) dt 

I / tp(t) dt I t 2 p(t) dt 



where 



Rip) 



Q a (l/ P it)) P (t)dt / tQ a (l/p(t)) P (t)dt 



\ / tQ a {l/p(t))p(t)dt / t'Q a (l/p(t))p(t)dt 




For a symmetric density, this matrix is diagonal and 

$ a (p)=diag(J Q a (l/p(t))p(t)dt, (j- f2 p( f ) df )2 )■ 

Consequently, the optimal symmetric design for estimating the slope in the linear regres- 
sion has the density (4.3), where ijl and r are defined in (4.2) and (4.4) (this follows from 
the fact that the element in position (2,2) of the matrix ^ a (p) corresponds to the opti- 
mally criterion for the linear regression through the origin). Numerical results indicate 
that the optimal symmetric design for estimating the slope is optimal in the class of all 
(not necessarily symmetric) designs. 

The D-optimal designs for the linear regression model have to be determined numer- 
ically in all cases. Some D-optimal design densities corresponding to the parameters 
a = 1/4, 1/2, 3/4, 0.95 and T = 1 are displayed in Figure 3. 
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4.3. Comparison of optimal designs under long- and short-range 
dependence 

It is of some interest to compare the asymptotic optimal designs under short- and long- 
range dependence. For this purpose, we again consider the linear regression model with 
no intercept. Bickel and Hcrzberg (1979) discussed the correlation function p\(t) = c _A l*l . 
The asymptotic optimal designs are given by 

fo, I*I<v^7m. 

P( H g -(M-rA* r ^<I*I<T, 
1 0, otherwise, 

where the quantities //, r are defined by 

= 1 | Jf 2 (t)Q a (l/P*(t))P*(t)dt 
V 2 7 + JP(t)p*(t)dt 
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T= 2^/ f 2 ^P*^ dt + J f(t)Q a (l/p*(t))p*(t)dt + J f 2 (t)Q' a (l/p*(t))dt, 

respectively (see Bickel, Hcrzbcrg and Schilling (1981)) and depend on the parameters A 
and 7 defined in (2.2). Some of these designs are shown in Figure 4, while the relevant 
parameters are given in Table 3, which also contains the efficiency of the uniform design. 
We observe that - in contrast to the case of long-range dependence - the uniform design 
is rather efficient, provided either that the parameter A is not too large or that 7 is not 
too small. 

We now compare asymptotic optimal designs derived under the assumption of a long- 
range dependence with asymptotic optimal designs under short-range dependence. In 
Table 4, we show the efficiency of a design derived under the assumption of short-range 
dependence, in the situation where the "true" correlation structure is a member of the 
SVF family. We observe that the loss of efficiency is only substantial if the parameter a 
is small. The opposite situation is displayed in Table 5, which shows the efficiency of the 
asymptotic optimal design under long-range dependence (from the SVF family), but the 
"true" correlation structure is in fact of exponential type. Again, the asymptotic optimal 
designs derived under the long-range dependence are rather efficient, except when the 
parameter a is very small. 



Table 3. Parameters of the asymptotic optimal design density for the linear regression through 
the origin, where the correlation function is given by px(t) = e" A ' ( ' (the last column of the table 
shows the efficiency of the uniform design (4.5)) 



A 


7 




T 




effuni 


()..-) 


0.5 


3.41 


0.32 


0.30 


0.89 


()..-) 


0.1 


9.82 


3.23 


0.57 


0.63 


0.5 


0.9 


2.38 


0.08 


0.18 


0.97 


0.1 


0.5 


12.70 


0.22 


0.13 


0.99 


2.5 


0.5 


1.45 


0.54 


0.61 


0.57 



Table 4. Efficiency of the asymptotic optimal design density for the correlation function p\{t) = 
e~ A '*' in the linear regression through the origin, while the "true" correlation function belongs 
to the SVF family 





Q 


0.05 


0.25 


0.50 


0.75 


0.95 


A = 0.5 


7 = 0.5 


0.62 


0.82 


0.96 


1.00 


0.97 


A = 0.5 


7 = 0.1 


0.81 


0.97 


0.99 


0.89 


0.77 


A = 0.5 


7 = 0.9 


0.53 


0.73 


0.90 


0.99 


1.00 


A = 0.1 


7 = 0.5 


0.50 


0.70 


0.88 


0.98 


1.00 


A = 2.5 


7 = 0.5 


0.81 


0.97 


0.98 


0.89 


0.77 
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Table 5. Efficiency of asymptotic optimal design density for a long-range dependence error 
structure in the linear regression through the origin, while the "true" correlation function is 
given by p x (t) = e _A|t| 



A 


0.5 


0.5 


0.5 


0.1 


2.5 


7 


0.5 


0.1 


0.9 


0.5 


0.5 


a = 0.05 


0.19 


0.40 


0.15 


0.15 


0.35 


a = 0.25 


0.69 


0.94 


0.59 


0.58 


0.93 


a = 0.50 


0.94 


0.98 


0.87 


0.86 


0.98 


a = 0.75 


1.00 


0.88 


0.98 


0.98 


0.85 


a = 0.95 


0.95 


0.73 


0.99 


1.00 


0.68 
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